Speech Detection and Enhancement Using Single Microphone for Distant Speech Applications in Reverberant Environments
نویسندگان
چکیده
It is well known that in reverberant environments, the human auditory system has the ability to pre-process reverberant signals to compensate for reflections and obtain effective cues for improved recognition. In this study, we propose such a preprocessing technique for combined detection and enhancement of speech using a single microphone in reverberant environments for distant speech applications. The proposed system employs a framework where the target speech is synthesized using continuous auditory masks estimated from sub-band signals. Linear gammatone analysis/synthesis filter banks are used as an auditory model for sub-band processing. The performance of the proposed system is evaluated on the UT-DistantReverb corpus which consists of speech recorded in a reverberant racquetball court (T60 ∼ 9000msec). The current system shows an average improvement of 15% STNR over an existing single-channel dereverberation algorithm and 17% improvement in detecting speech frames over G729B, SOHN & Combo-SAD unsupervised speech activity detectors on actual reverberant and noisy environments.
منابع مشابه
Improve Speech Recognition Performance in Reverberant Environment Based on Estimation of Energy Feature
The purpose of this paper is to improve speech recognition performance in reverberant environment with distant talking, which based on directly speech processing or selected Log-energy feature. Speech recognition performance can be improved by changing the value of Log-energy feature or directly speech energy. Experiments used CENSREC-4 corpus to evaluate distant-talking speech under various re...
متن کاملBayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data
In this contribution we investigate the effectiveness of BAYESIAN feature enhancement (BFE) on a medium-sized recognition task containing real-world recordings of noisy reverberant speech. BFE employs a very coarse model of the acoustic impulse response (AIR) from the source to the microphone, which has been shown to be effective if the speech to be recognized has been generated by artificially...
متن کاملTitle Placeholder
A speech signal captured by a distant microphone is generally contaminated by reverberation and background noise, which severely degrade the automatic speech recognition (ASR) performance. In this paper, we first extend a previously proposed single channel dereverberation algorithm to a multi-channel scenario. The method estimates late reflections using multichannel multi-step linear prediction...
متن کاملMulti-step linear prediction based speech dereverberation in noisy reverberant environment
A speech signal captured by a distant microphone is generally contaminated by reverberation and background noise, which severely degrade the automatic speech recognition (ASR) performance. In this paper, we first extend a previously proposed single channel dereverberation algorithm to a multi-channel scenario. The method estimates late reflections using multichannel multi-step linear prediction...
متن کاملSingle Channel Dereverberation Using Example-Based Speech Enhancement with Uncertainty Decoding Technique
A speech signal captured by a distant microphone is generally contaminated by reverberation, which severely degrades the audible quality and intelligibility of the observed speech. In this paper, we investigate the single channel dereverberation which has been considered as one of the most challenging tasks. We propose an example-based speech enhancement approach used in combination with non-ex...
متن کامل